Discriminative Learning of Feature Functions of Generative Type in Speech Translation
نویسندگان
چکیده
The speech translation (ST) problem can be formulated as a log-linear model with multiple features that capture different levels of dependency between the input voice observation and the output translations. However, while the log-linear model itself is of discriminative nature, many of the feature functions are derived from generative models, which are usually estimated by conventional maximum likelihood estimation. In this paper, we first present the formulation of the ST problem as a log-linear model with a plurality of feature functions. We then describe a general discriminative learning framework for training these generative features based on a technique called growth transformation (GT). The proposed approach is evaluated on a spoken language translation benchmark test of IWSLT. Our experimental results show that the proposed method leads to significant improvement of translation quality. Fast and stable convergence can also be achieved by the proposed method. 1. Electronic Submission Speech translation (ST) takes the source speech signal as input and produces as output the translated text of that utterance in another language. It can be viewed as automatic speech recognition (ASR) and machine translation (MT) in tandem. Like many other machine learning problems, the speech translation (ST) problem can be modeled by a log-linear model with multiple features that capture different dependencies between the input voice observation and the output translations. Although the log-linear model itself is a discriminative model, many of the feature functions, such as scores of ASR outputs, are still derived from generative models. Further, these features are usually trained by conventional maximum likelihood estimation. In this paper, we propose a general framework of discriminative training for these generative features based on a technique called growth transformation (GT). The proposed approach is evaluated on a spoken language translation benchmark test called IWSLT. Our experimental results show that the proposed method leads to significant translation performance improvement. It is also shown that fast and stable convergence can be achieved by the proposed GT based optimization method.
منابع مشابه
Discriminative Models for Semi-Supervised Natural Language Learning
An interesting question surrounding semisupervised learning for NLP is: should we use discriminative models or generative models? Despite the fact that generative models have been frequently employed in a semi-supervised setting since the early days of the statistical revolution in NLP, we advocate the use of discriminative models. The ability of discriminative models to handle complex, high-di...
متن کاملFEAST at Play: Feature ExtrAction using Score function Tensors
Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we build upon a novel framework called FEAST(Feature ExtrAction using Score function Tensors) which incorporates generative models for discriminative learning. FEAST considers a novel class of matrix and tensor-valued featur...
متن کاملScore Function Features for Discriminative Learning: Matrix and Tensor Frameworks
Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features, which can be pre-trained using unlabeled samples. We present efficient algorithms for extracting discriminative information, given these pre-trained features an...
متن کاملScore Function Features for Discriminative Learning: Matrix and Tensor Framework
Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features, which can be pre-trained using unlabeled samples. We present efficient algorithms for extracting discriminative information, given these pre-trained features an...
متن کاملScore Function Features for Discriminative Learning
Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features, which can be pre-trained using unlabeled samples. We present efficient algorithms for extracting discriminative information, given these pre-trained features an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013